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There are many Local texture features each very in way they implement and each of the Algorithm trying 
improve the performance. An attempt is made in this paper to represent a theoretically very simple and com¬ 
putationally effective approach for face recognition. In our implementation the face image is divided into 3x3 
sub-regions from which the features are extracted using the Local Binary Pattern (LBP) over a window, fuzzy 
membership function and at the central pixel. The LBP features possess the texture discriminative property 
and their computational cost is very low. By utilising the information from LBP, membership function, and 
central pixel, the limitations of traditional LBP is eliminated. The bench mark database like ORL and Sheffield 
Databases are used for the evaluation of proposed features with SVM classifier. For the proposed approach K-fold 
and ROC curves are obtained and results are compared. 
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1. INTRODUCTION 

Face Recognition (FR) is an effortless task for 
humans while it is difficult task for machines due 
to pose and illumination variation, ageing, facial 
growth to mention a few. While having so many 
difficulties face carries more information as com¬ 
pared to iris, figure, gait etc. FR has advantage 
that, it can be captured at a distance and in hid¬ 
den manner whereas other biometric modalities 
like fingerprint, palm prints and knuckle prints 
require physical contact with a sensor. The FR 
field is active research area because of surveil¬ 
lance, Pass port, ATM etc. The problem with 
FR is the high-dimensionality of feature space 
(original space). A straightforward implementa¬ 
tion with original image space is computationally 
costly. Therefore, techniques of feature extrac¬ 
tion and feature selection are used. In the re¬ 
cent past many FR algorithms have been imple¬ 
mented. These algorithms can be roughly classi¬ 
fied into three categories: 


1. Feature based matching methods that deals 
with local features such as the eyes, mouth, 
and nose and their statistics which are fed 
into the recognition system [14]. 

2. Appearance-based schemes that make use 
of the holistic texture features [17]. 

3. Hybrid Features that uses local and holis¬ 
tic features for the recognition. It is re¬ 
ported that the Hybrid approach gives bet¬ 
ter recognition results [26]. 

One of the implementations of the structure- 
based schemes is based on the human perception 
[24]. In this implementation, a set of geometric 
face features, such as eyes, nose, mouth corners 
etc., is extracted. The features are selected based 
on the specific facial points chosen by a human 
expert. The positions of the different facial fea¬ 
tures form a feature vector which is the input to a 
structural classifier to identify the subject. In the 
Appearance-based schemes, two of the most pop- 
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ular techniques are Principal Component Analy¬ 
sis (PCA) [201 and Linear Discriminant Analysis 
(LDA) [27]. 

PC A: Principle Component Analysis is one of 
the holistic approaches. This is one of the statis¬ 
tical approaches based on unsupervised learning 
method. In which the Eigen vectors are computed 
from the covariance matrix. PC A model the lin¬ 
ear variations in high dimensional data. PC A 
preserves the global structure of the image in the 
sense the image can be completely reconstructed 
with PC A. It does the guarantees dimensionality 
reduction by projecting the n-dimensional data 
onto the r {r « n) dimensional linear subspace. 
The objective here is to find a set of mutually or¬ 
thogonal basis functions that capture the direc¬ 
tions of maximum variance in the data. Since 
PCA is used for the dimensionality reduction, 
hence it can be used for image compression. We 
can find many holistic approaches in the litera¬ 
ture like kernel PCA, 2D-PCA Complex PCA etc 

LDA: Linear Discriminant Analysis, L is called 
supervised learning algorithm method. The 
Fisher faces method he build in the concept of 
LDA. LDA builds the projection axis in such a 
way that the data points belonging to same class 
are nearer, and projection points are for the inter¬ 
class. LDA encodes the discriminating informa¬ 
tion in a linear separable space using bases which 
are not necessarily orthogonal unlike in the case 
of PCA. As the face samples involve high dimen¬ 
sional image space, the Eigen-faces approach is 
likely to find the wrong components that leads to 
poor recognition, on the other hand, the Fisher- 
faces are computationally expensive. 

The ultimate aim of any feature extraction al¬ 
gorithm is, it should build highly describable fea¬ 
tures. The developed features should produce 
ideally zero variance within the class, and should 
produce infinite variance for the inter class. 

The organization of the rest of paper is as fol¬ 
lows. Section 2 describes the basic LBP and 
it’s variants. Section 3 discusses the fuzzy logic. 
In section 4 the proposed method is presented. 
Section 5 describes the experimental setup and 
results. Section 6 gives the conclusions of this 
paper. 
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Figure 1. The basic Local Binary Pattern (LBP) 
operator 


2. BASIC LBP AND ITS VARIANTS 

In this section brief concepts of basic LBP and 
its Variants are discussed The basic LBP was pro¬ 
posed by [15] and some of its variants are Tan 
and Triggs [19] proposed Local Ternary Pattern 
(LTP) uses three levels (+1, 0,-1), these three 
level are obtained by quantizing the difference be¬ 
tween a central pixel and its neighbouring pixel 
gray. Some of the variants of LBP, are derivative- 
based LBP [9], Sobel-LBP [25], Uniform LBP [2], 
Rotation invariant LBP [5], Multi-dimensional 
LBP [18], dominant LBP [13], centre-symmetric 
LBP [21] and Transition LBP [1], have been pro¬ 
posed . 

2.1. Basic Local Binary Patterns 

LBP concept is applied to area like face recog¬ 
nition [1], dynamic texture recognition [23] and 
shape localization [9]. The Local Binary Pattern 
(LBP) method is widely used in 2D texture anal¬ 
ysis. The LBP operator is a non-parametric 3x3 
kernel which describes the local spatial structure 
of an image. It was first introduced by Ojala et al 
[15] who showed the high discriminative power 
of this operator for texture classification. At a 
given pixel position (xc^c), LBP is defined as 
an ordered set of binary comparisons of pixel in¬ 
tensities between the centre pixel and its eight 
surrounding pixels. The decimal values of the re¬ 
sulting 8-bit word (LBP code) leads to 28 possi¬ 
ble combinations, which are called Local Binary 
Patterns abbreviated as LBP codes with the 8 
surrounding pixels. The basic LBP operator is a 
fixed 3x3 neighbourhood as in Fig. 

The LBP operator can be mathematically ex- 
















New Fuzzy LBP Features for Face Recognition 


3 


pressed as: 

8 

LBP{xc,y,) = Y,s{In-Ic)‘2^ ( 1 ) 

n=l 

Where Ic corresponds to the gray value of the 
centre pixel {xc^Vc)^ In to the gray values of the 
8 surrounding pixels and function s is defined as, 


s{x) 


1, if X > 0 
0, if X < 0 


( 2 ) 


2.2. Uniform LBP 

Uniform LBP binary patterns, proposed by T. 
Ojalaet. al [16] have certain fundamental proper¬ 
ties of texture. In this implementation they have 
considered if the frequency of occurrence exceeds 
90% then they call it as uniform patterns. The 
frequency considered here is the transition from 
zero to one and one to zero. 

2.3. Multi-block LBP (MB-LBP) 

As in the case of basic LBP the central pixel is 
taken as threshold and neighbouring pixel is as¬ 
signed values. In the case of MB-LBP [22] the 
same procedure is adapted. They consider the 
rectangular box as the reference. For this refer¬ 
ence rectangular box the average value is com¬ 
puted and this average value will be acting as 
threshed, and the neighbouring blocks are as¬ 
signed the value based on threshold. 

2.4. 3-Patch (3P-LBP) and 4-Patch (4P- 
LBP) LBP codes 

3P-LBP [3] is obtained by comparing the val¬ 
ues of three neighbouring windows to produce a 
one bit value. A 3 x 3 window centred on a pixel 
and S additional windows distributed uniformly 
in a ring of radius r around each pixel is con¬ 
sidered. Pairs of windows are compared with the 
centre patch, and a bit set describing which of the 
two windows is more similar to the centre patch. 
In 4P-LBP, two rings of radii ri and r 2 are con¬ 
sidered. The code is then produced by comparing 
the two centre symmetric windows in the inner 
ring with two centre symmetric windows in the 
outer ring positioned a patches away along the 
circle. 


2.5. LBP variance (LBPV) 

In LBP variance (LBPV) [5], the contrast of 
the image is used along with neighbouring points 
and radii, are used to generate joint distribution 
of local pattern that yields a dominant texture de¬ 
scriptor. LBPV contains both local pattern and 
local contrast information. An alternative is the 
use of a hybrid scheme, LBP variance (LBPV), 
which uses globally rotation invariant matching 
with locally variant LBP texture features. LBPV 
provides efficient joint LBP and contrast distri¬ 
bution where the variance is used as an adap¬ 
tive weight to adjust the contribution of the LBP 
code in histogram calculation. In spite of the the 
great success of LBP in pattern recognition, its 
underlying working mechanism still needs more 
investigation. Even after having so many versions 
of LBP the problem still remain to be address 
other possibilities which can cater better recog¬ 
nition rate in other words which information is 
missing in the LBP and how to effectively repre¬ 
sent the missing information in the LBP style so 
that better texture classification can be achieved? 

Looking at the Limitation of the PCA, LDA 
techniques and taking advantage of Information 
sets, which have been developed by Hanmandlu 
[6] to enlarge the scope of fuzzy sets. Infor¬ 
mation set concept is simple understand. The 
image after portioned is treated as the informa¬ 
tion source. The information sources contained 
in windows form the fuzzy sets. The distribution 
of these information sources in the fuzzy sets re¬ 
quires an appropriate membership function to fit. 
When fuzzy set coupled with membership func¬ 
tion forms the information set. We concentrate 
on extracting the local fuzzy features from im¬ 
ages, instead of considering complete image which 
is a high-dimensional vector. To extract the sim¬ 
ple and robust local features of an image, we 
use basic Local Binary Pattern (LBP), informa¬ 
tion set and the central pixel. These features are 
tested empirically on SVM [4]. 

3. FUZZY LOGIC 

Fuzzy Logic theory has been extensively used 
in all the fields, it is the extension of crisp set the¬ 
ory and fuzzy logic (FL) will helps to deal with 
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imprecise and uncertain data. It tackles the con¬ 
cept of partial truth. It was introduced by Prof. 
Zadeh to model the vagueness and ambiguity in 
complex systems for which there is no mathemat¬ 
ical model. The limitation of fuzzy sets is that 
information source values and their membership 
function values are treated separately for all the 
problems dealing with the fuzzy logic theory. The 
membership function value gives the degree of as¬ 
sociation of a particular information source value 
in a fuzzy set [6]. The proposed work seeks to 
combine the information source values with their 
membership values together called Information 
set. The fuzzifier developed by Hanmandlu et.al 
[8] has the scope to enlarge the fuzzy set. To give 
an example of agent, consider a set of students 
graded based on the performance of the class top¬ 
per as the benchmark and the students individual 
performance (information source value) is deter¬ 
mined by comparing his performance with that of 
the topper (/i). Some of the popular membership 
functions are as given below. 


tion. 



if X < a 
if a < X < 
if ^ <x<b 

if X > b 

( 4 ) 


3. Gaussian Membership Function The sym¬ 
metric Gaussian function depends on two 
parameters a and c as given in equation (§. 


/i(x; cr, c) ^ e 2 (t 2 (5) 

where a is variance and c is center. These 
two parameter define the shape of the Gaus¬ 
sian bell. Fig. [^illustrates Gaussian mem¬ 
bership function. 


1. S Shaped Membership Function is a spline- 
based curve. The parameters a and b lo¬ 
cate the extremes of the sloped portion of 
the curve as given in equation Fig. 


2a 


illustrates S shaped membership function. 


jii{x; a^b) ^ < 


0 , 

ir 


if X < a 
if a < X < 
if ^ <x<b 

if X > b 

( 3 ) 


2. Z Shaped Membership Function is a spline- 
based function of x. The parameters a and 
b locate the extremes of the sloped portion 
of the curve as given in equation Fig. 
2b illustrates Z shaped membership func- 


4. PROPOSED METHOD 

Uncertainty appears in the gray values of im¬ 
ages. Instead of considering the whole image and 
their gray values, we divide the image into sub 
images of size 3x3 (enclosed in the window) and 
extract local information using LBP method. We 
consider non overlapping sub images. We com¬ 
pute a membership function for each window us¬ 
ing S-Membership Function, Z-Membership Fac¬ 
tion, Gaussian Membership Function, the New 
proposed membership Function and Root Mean 
features. For all the features we use central pixel 
to compute the final feature, the reason being the 
central pixel value is lost once we obtain the LBP 
feature. Let fiij be the membership value of the 
window using any of the above method the size 
of the membership will be 3 x 3 when we get this 
from the 3x3 window. The procedure is ex¬ 
plained in the later part of this section. Let S be 
the 3x3 window in the image domain. Let 
be the information set obtained from the window, 
that is the element by element product of the S 
and jXij denoted as = S.XjUij, where .X 
represents the element by element multiplication. 
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After element by element multiplication, we get 
3x3 matrix, we take the sum as given below. 

w 

= (6) 

i=l 

where is the information set obtained after 
the summation. Once we obtain the , next we 
compute the LBP code the example is as given in 
Fig. [T] In order to take account of the informa¬ 
tion from the neighbourhood pixels (information 
sources), we compute the LBP value for the win¬ 
dow. This is given by: 

Lyj = LBP{xc,yc) (7) 


We convert the LBP code to the decimal value. 
Let the value of LBP from a window be denoted 
by Luj. The New membership function based fea¬ 
ture is given by 

F^=ff^xL^x I, (8) 


where Ic represent the centre pixel in the window. 
The New membership function is computed from 
the information of local structural details like av¬ 
erage of the window and maximum in the window 
under consideration. The New membership func¬ 
tion is given by 





( 9 ) 


where li^j represent the normalized intensity 
value which we will obtain by dividing the max¬ 
imum in the image. lavg is the average in the 
window under consideration and Imax is the max¬ 
imum in the window under consideration. Algo¬ 
rithm describes various steps involved for the 
computation of features using various member¬ 
ship functions for a given image. 


4.1. The Root Mean Square Feature 

This feature depend upon the two parameters 
on is the fuzzifier and the central pixel value and 
the LBP code. The fuzzifier developed by Prof. 
Hanmandlu has the scope to expand the fuzzy set. 
The fuzzifier /| in (10) is devised by Hanmandlu 
et al. in [7] and it gives the spread of attribute 


Algorithm 1 Feature Extraction 
Input: Face Image / 

Output: New membership function based feature vec¬ 
tor for image / 

1: Normalize the image / by dividing each pixel value 
with the maximum intensity value 
2: Partition the image I into 3x3 windows / = 
{w^, • • • w^} 

3: for k ^ l,d do 

// Compute the proposed new membership function 
values for 

4: Ic = W 22 // Central pixel value 

5: for z ^ 1, 3 do 

6: for j ^ 1, 3 do 

7: Compute the membership value [iij using 

0.0- 0 or 

8: end for 

9: end for 

10: Compute Information Set value ip^^ using l|6l 

11: Compute LBP value L^k using Q 

12: Computer the feature using (13 

13: II Store in feature vector F^^ 

14: end for 


values with respect to the chosen reference (sym¬ 
bolized as ref). It is defined as 

Iref can be taken as average or minimum or max¬ 
imum from the window under consideration. It 
may be noted that the above fuzzifier gives more 
spread than is possible with variance as used in 
the Gaussian function. 

Fw = {\ln + g) xIcXL^ ( 11 ) 

The contribution of this method is that it elim¬ 
inates the shortcoming of LBP approach which 
ignores the central pixel value and it considers 
information of both central and neighbourhood 
pixels. 

5. EXPERIMENTAL RESULTS 

The proposed method is simple and computa¬ 
tionally efficient. A large number of experiments 
were conducted on ORL and Sheffield databases, 
shows the effectiveness of our method. Olivetti 
Research Laboratory (ORL) in Cambridge, U.K. 
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(a) ORL database images of different subjects 



(b) Sheffield database images of different subject 


Figure 3. Sample images of benchmark databases 


AT&T database has 400 images with 40 differ¬ 
ent subjects with 10 images per subject of size 
112 X 92. Sheffield database has 20 classes the 
size of the each image is approximately 220 x 220. 
With lot of orientation of head, the degree of ro¬ 
tation is ±90 about the center axis. The 20 sub¬ 
jects of Sheffield database are kept in separate 
folders. Number of images in each older varies in 
number. In both databases, we normalize all of 
the gray (pixel) values by dividing them by the 
maximum gray value in that image. So the range 
of the pixel values lies in the range 0 to 1 (also 
known as lij, the normalized information source 
values). For both databases we have resized the 
images to integer multiplication of 3 so as to avoid 
the padding of zeros at the end. 

Then the next step is to divide the image into 
3x3 windows which are non-overlapping. For 
each 3x3 window, we compute the jUij and per¬ 
form the element by element multiplication with 
S. and obtain the centre pixel value ic of a win¬ 
dow, We then compute the LBP value from a 
window using 0 . The total information (fea¬ 
ture) is the product of information set, the cen¬ 
tral pixel, and the decimal value obtained from 
the neighbouring pixels. This total information 
forms a feature from each window. All features 
are stored as an array for the complete image. 
Suppose if image is of size 63 x 63 we get an array 
of 441 features. For 400 images we store features 
as 400 X 441. The complete set is divided into the 
training set and the test set. We have used SVM 


[4]classifier which is available from PRT tools. 
The database is divided into train set and the 
test set. The first 50% dataset is used for train¬ 
ing and remaining 50% is used for testing. For 
ORL database the recognition rates are given in 
Table The good performance is achieved from 
New Membership Function for all the degree of 
Polynomial 1,2,3. Fuzzy logic gives the flexibil¬ 
ity of selecting membership. Either it can be the 
Standard Membership functions like Gaussian or 
sigmoid or one can design his/her own member¬ 
ship function to cater the need, k-fold is one of 
validation technique in which the data is approx¬ 
imately equally divided. Each time the one set of 
data is used for testing and the remaining por¬ 
tion of the data is used for training, the data 
which was in test will be placed back as train¬ 
ing. Normally the procedure is repeated K times 
hence the name K-fold. The value of K is nor¬ 
mally taken as 10. Eor our implementation we 
have taken SVM classifier for K-fold Validation. 
Fig. a shows the K-fold results of the recogni¬ 
tion on ORL Database. The best performance is 
given by Root Mean Square (RMS) average recog¬ 
nition rate of 96% followed by Gaussian Member¬ 
ship function, and the proposed New ME with 
average recognition rate of 95.75%. Receiver op¬ 
erating characteristic (ROC) curve is graphical 
plot that defines the performance of a classifier 
system as its discrimination threshold is varied. 
Eor obtaining the ROC curves we have taken the 
K-NN Classifier. Eig. shows the ROC results 
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Table 1 

Recognition rate (in %) for ORL database with SVM classifiers 


ORL Database 

SVM Polyl 

SVM Poly2 

K-Fold Results 

MIN 

MAX 

AVG 

SMF 

84 

78.5 

87.5 

100 

95.25 

ZMF 

82 

79 

87 

100 

94.50 

Gauss MF 

85 

80 

87.5 

100 

75.75 

New MF 

85.5 

81 

87.5 

100 

95.75 

RMS 

85 

80.5 

90 

100 

96 


Accuracy vs K-fold on ORL Database 



Figure 4. K-fold Recognition rates on ORL Database with 10 folds 


of the recognition on ORL Database. The recog¬ 
nition rates at 0.1 of FAR are as follows with 
SMF the recognition rate we got is 55%,with ZMF 
the recognition rate 52%, with Gaussian-MF the 
recognition rate 59%, with New-MF the recog¬ 
nition rate is 57%, and with RMS features the 
recognition rate obtained is 54%. And The best 
performance is given by Gaussian MF and the 
least performance is given by ZMF. 

The experiment is also conducted on Sheffield 
Database. Firstly the images are resized to 
90 X 90. After extracting the features for the 
complete database, the database is divided into 
train set and the test set. The first 50% dataset 
is used for training and remaining 50% is used 
for testing. For Sheffield database the recognition 
rate are given in Table The good performance 
is achieved from RMS features for the Degree of 
Polynomial 1,2, and the second best performance 


is given by the New MF. With RMS features the 
recognition rate is 97.14% with SVM poly l.the 
weak performer is the S-MF as the SVM poly 2 
has given a recognition rate of 85.71%. The K- 
fold validation is done on the Sheffield Data base. 
Fig. 1^ Shows the K-fold results of the recognition 
on Sheffield Database. The tabular representa¬ 
tion is shown in the table 2 for the k-fold results. 
Database the best performances are given by Z- 
MF, Gaussian-MF and New MF with the average 
of all ten folds as 100% where as the Root Mean 
Square (RMS) and Features obtained from S-MF 
with minimum recognition rate as 90% and maxi¬ 
mum recognition rate as 100% and average recog¬ 
nition rate of 99%. Here in this case the New- 
MF has worked well. Whereas the RMS features 
are second best. Receiver operating characteris¬ 
tic (ROG) curve is also plotted. For obtaining the 
ROG curves we have taken the K-NN Classifier. 
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False Accept Rate 


Figure 5. ROC Curves for various features on ORL database 
Table 2 

Recognition rate (in %) for Sheffield database with SVM classifier 


Sheffield Database 

SVM Polyl 

SVM Poly2 

K-Fold Results 

MIN 

MAX 

AVG 

SMF 

92.14 

85.71 

90 

100 

99 

ZMF 

92.14 

87.85 

100 

100 

100 

Gauss MF 

95 

92.14 

100 

100 

100 

New MF 

95 

92.14 

100 

100 

100 

RMS 

97.14 

90.71 

90 

100 

99 


Fig.0 Shows the ROC results of the recognition 
on Sheffield Database. The recognition rates at 
0.1 of FAR are as follows with S-MF the recogni¬ 
tion rate we got is 62%, with Z-MF the recogni¬ 
tion rate 65%, with Gaussian-MF and the New- 
MF recognition rate 78%, and with RMS features 
the recognition rate obtained is 73%. The better 
performance is given by Gaussian MF and the 
least performance is given by S-MF. 

Comparisons with other works: Arindam Kar 
et. al [12] have reported the recognition rate on 
ORL database with Principle Component Analy¬ 
sis as 80.5% and with Independent Component 
Analysis (ICA) with 85%. Arindam Kar et. 
al in another work [11] have reported the recog¬ 
nition rate with Principle Component Analysis 
as 82.86%. Author in [10]has reported recogni¬ 
tion rate of 63% on Locally preserving projection 
(LPP). All the recognition are done with ORL 


database. 

6. CONCLUSIONS 

In this research work, we have developed five 
different Fuzzy Local Binary Patterns (Fuzzy 
LBP) as a local descriptor for face recognition. 
The features are designed keeping the aim to 
improve the recognition rate. Our main contri¬ 
butions include: The use of information set to 
LBP and to use it for face recognition to produce 
highly describable features. The information that 
is lost in computation of Basic LBP is now used 
for computation of final feature. In order to gen¬ 
erate the optimal number of features, only non 
overlapping windows are considered, and hence 
computations of histogram are avoided. The pro¬ 
posed approach is found to be effective on images 
having variation in expressions, illumination and 
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Accuracy vs K-fold 



Figure 6. K-fold Recognition rates on Sheffield Database with 10 folds 


pose. The experiments reveal that better the rep¬ 
resentation of the uncertainty, better will be the 
recognition rates. Further work can be extended 
to use the type 2 fuzzy set, and design of new 
classifier. 
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ROC Curves on Sheffield Database 



False Accept Rate 


Figure 7. ROC Curves for various features on Sheffield database 
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Figure 2. Fuzzy Membership Functions 











